Heterogeneity, and the secret of the sea 
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This paper explores tools for modeling and measuring the compositional heterogeneity of a rock, 
or other solid specimen. Intuitive "variation per decade" plots, simple expressions for containment 
probability, generalization for familiar error-in-the-mean expressions, and a useful dimensionless 
sample bias coefficient all emerge from the analysis. These calculations have also inspired subsequent 
work on log- log roughness spectroscopy (with applications to scanning probe microscope data), 
and on angular correlation mapping of lattice fringe images (with applications in high resolution 
transmission electron microscopy). It was originally published as Appendix E of a dissertation^ on 
"Microcharacterization of interplanetary dust collected in the earth's stratosphere" . 
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2ND MOMENT STATISTICS: NEXT AFTER 
MEAN 



An intuitively simple, but quantitative, approach to 
spatial heterogeneity is possible using mathematics which 
claims wide familiarity because of its already diverse ap- 
plications. The composition of a specimen can be consid- 
ered a function of spatial position whose statistical prop- 
erties are fully described by a set of underlying probabil- 
ity distributions^. These distributions are fully specified 
by knowledge of their moments. Naturally, the first pa- 
rameters to measure in describing these distributions are 
the first moments: the average values or compositional 
abundances. Elemental and mineral (modal) abundances 
have long been familiar tools in the characterization of 
geological specimens. 

The logical next step in specifying the distributions is 
to use information on point-to-point variations to mea- 
sure the second moments of the probability distributions. 
If, for simplicity and in the interest of improved sam- 
ple statistics, the underlying distributions are consid- 
ered invariant under translations and rotations, then the 
statistics of point-to-point variations are fully described 



by the set of one-parameter functions known as radial 
covariances. If these functions are measured (using a 
three-dimensional array of sample points) for a specimen 
whose distributions are not invariant under rotations and 
translations (e.g. in samples with aligned crystals or a 
compositional gradient), then these functions will repre- 
sent averages (over direction and position) of covariance 
functions which depend on orientation and absolute po- 
sition in the sample. Of course, if one wishes to infer 
statistical properties of the parent material from a labo- 
ratory sampling, it is necessary (and traditional) to as- 
sume translational invariance in the absence of contrary 
information. Because inferences from samples volume to 
the larger specimen are an important aspect here, a def- 
inition of source material extending only to material in 
which the underlying distributions are "stationary" will 
be implicit in the discussion to follow. Rotational invari- 
ance (or isotropy) for the underlying distributions will 
not be taken for granted, although its assumption will 
be necessary if one wishes to measure the radial covari- 
ances from polished sections which do not sample a wide 
variety of orientations in the specimen. 

Applications for the concept described above are most 
familiar in the field of time series analysis2j4, where the 
onc-dimcnsional independent variable, time, is used in 
place of the three-dimensional variable, position. How- 
ever, the concept itself is quite general, and myriad two 
and three dimensional spatial applications already exist 
in the literature^. In fact, curiousity about the statisti- 
cal properties of composition in rocks was inspired with 
an article by Martin Gardner^ concerning applications to 
natural phenomena ranging from the topology of cratered 
terrain^ to the pattern of notes and rhythm in a piece of 
music^. 

The general covariance function (or cross-covariance) 
of variables A and B evaluated at regions displaced by 
Ax is defined by 

aAB[Ax] = ({A[x + Ax] ~ fiA}{B[x] - fiB})s- (1) 

Conventional notation will be used here: population (or 
"parent specimen" ) averages are denoted by angle brack- 
ets or Greek letters, while sample (i.e. measured) av- 
erages are denoted by a bar over the quantity, or Roman 
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letters. In particular, ^.a and ^,3 are defined as the popu- 
lation averages of A and B respectively, with the averages 
here taken over all possible values of location vector x. 
Note: This quantity is sometimes put into dimensionless 
(correlation coefficient) form by normalizing with stan- 
dard deviations of A and B, as in pab[^x\ = 

For the petrographic application, it is useful to exam- 
ine a mixture of n homogeneous phases. Consider spatial 
heterogeity for the variables A and B which take on val- 
ues of iA and ii?, respectively, in the ith phase, i — l,n. 
If p[i,j,t] is the joint probability that, for two sample 
points separated by a distance (or lag) i, the first will lie 
on a phase i region and the second will lie on a phase j 
region, then the radial covariance function for A and B 
can be written: 



CTABit] = ^^p[i,i,t](iA- ^a)(jB - ^b)- 



(2) 



i=i j=i 



li A = B, then the quantity defined above is referred to 
as the radial auto-covariance for A. 

The aAB [t] constitute a measure of the statistical cor- 
relation between the value of the parameters A and B 
at points in the specimen separated by a distance t. For 
the special case t = 0, the radial covariance cryiB[i] re- 
duces to the covariance for A and B in the specimen, i.e. 
cabIO] = CTAB- Similarly the radial auto-covariance for 
A, crAA[t], reduces to the variance of A in the specimen, 
i.e. crAA[0] = (TA^- Using this notation, the absolute 
value of aAB [t] for the specimen is less than or equal to 
the product of standard deviations aA<^B, and it is equal 
to zero when values of A and B measured at points sep- 
arated by length t are uncorrelated. 

Although the general covariance functions contain all 
information on the statistics of point-to-point variations 
in a specimen, the value of the function at any chosen 
argument depends upon heterogeneities that may have 
many different length scales^. Fortunately, knowledge 
of covariance functions can be converted, without loss 
of detail, to knowledge of spectral densities which serve 
to separate effects due to heterogeneity on various size 
scales. In this way, data obtained by techniques which 
are sensitive to heterogeneity on different size scales can 
be combined to provide a picture of compositional vari- 
ability over a wide range of spatial frequencies. 

If we mimic equation^and define correlation Cab [Ax] 
as the average over all a; of ^ times B at regions displaced 
by Ax, one can quite generally write aAB = CAB—fJ-AfJ-B, 
where the product of the means is a constant with no de- 
pendence on Ax. But the Fourier correlation theoremifi 
says that correlation is the inverse transform of the con- 
jugate product of spectral densities for A and B in the 
specimen, and fj,A and /i b are volume-normalized spectral 
densities measured at zero frequency. Thus the Fourier 
transform of aAB is the conjugate product of volume- 
normalized spectral densities for A and B, with the zero- 
frequency value set to zero. 

To be specific, equation|21defines a function of the mag- 
nitude of the separation between two points in the sam- 



ple, but not of the direction of that separation. Its three- 
dimensional Fourier transform (using signal processing 
conventions) can be written as: 

jabU] ^'^J J sm[27r ft]CAB[t]dt. (3) 


The inverse equation for this expression is: 

OG 

Cab W = 2 y ( sin[27r/i]7^B [f]df. (4) 


For the case t = 0, this yields the relationships: 

oc 00 

Cab[0] = j ^TiflABdf ^ j ATTf-iABd{\nf ). (5) 
—00 

Hence one might relate spectral density physically to 
47r/^7yiB, which from above is that portion of the co- 
variance aAB [0] = Cab [0] — /^aA* b resulting from compo- 
sitional fluctuations occurring with spatial frequency in 
the interval from / to f + df. For a quantity independent 
of the units used to measure frequency, the second equal- 
ity in[Slrelates spectral density to iTrf^jABj that portion 
of the covariance between A and B in a given "e-fold" 
of frequency (or decade of frequency should one further 
multiply by In 10). 

One example is the special case when all variations 
occur with one frequency, fo- Sinusoidal banding of an 
otherwise homogeneous material would, for instance, give 
rise to this condition. Regardless of details of the under- 
lying distribution, however, the radial auto-covariance 
and spectral-density [sans origin) for a variable of this 
type can be written: 



and 



_ 2 sm[2TT fot] 
crAA[t\ = cta „ r , ■ 



1aa[j\ = Ca- 



(6) 



(7) 



II. THE WELL-MIXED MODEL 

Although long-range order, like that described by 
equations El and 13 is certainly possible in geological ma- 
terials (e.g. in compositional zoning of mineral single 
crystals), it is likely to be the exception rather than the 
rule in most aggregate materials. A more likely model is 
one which assumes that the specimen consists of single- 
phase regions whose relative positions are fully random 
(that is, well-mixed) with respect to one another. If the 
regions are atomic dimensions in size, then the model 
describes a glass; if the single-phase regions are consider- 
ably larger, then the model would describe, for example, 
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a well-mixed breccia. In igneous or metamorphosed ma- 
terials, the possibility of compositional correlations be- 
tween adjacent crystals could give rise to deviations from 
this well-mixed case. 

Even a well-mixed model could be quite difficult ana- 
lytically if one wishes to consider the details of the inter- 
play (in three dimensions) between grain shapes, sizes, 
and the packing arrangement. However, considerable 
simplification results if one assumes short range order in 
the sense that lage, which leave their initial grain, neces- 
sarily terminate on points compositionally uncorrelated 
with the starting location. If we define p[i] as the frac- 
tional abundance of phase i points, and ^^[t] as the prob- 
ability that lag t, beginning at an arbitrary location on 
a grain of type i, will not leave that grain before ending, 
then we can write: 



a-AB [t] 



(8) 



and 



lABlf] - ^pM(.A - ^iA)UB - Mi3)S.[/]. (9) 

1=1 

The function Sj [/] is the radial Fourier transform of the 
containment probability [t] , which in turn obeys the sim- 
ple constraint: 

oo 

4^ J ^,[t]t^dt = S,[0] = {Vi),i = l,n. (10) 



Here, (Vi) is the (volume-weighted) average volume for 
grains of type i. 

Calculations for several simple grain geometries have 
been attempted. For spherical grains of diameter D, the 
containment probability isii: 

= { 1 - i + 3 (if ,t<D. (^^x 

^ ,t>D. ^ ^ 

Of course, not only are truly spherical grains rare, but a 
model made up solely of spheres of one size necessarily is 
filled with voids. 

A more realistic model for containment probabilities 
comes from the case of rectangular solids. Unfortunately, 
a full analytical solution has not yet been obtained. The 
case for a cube of side d has been tabulated by a combina- 
tion of analytical and numerical methods. Somewhat by 
accident it was discovered that an RMS deviation from 
the exact values of only 0.0045, over the non-zero range 
< t < VSrf, is obtained with the much simpler approx- 
imation: 



m = { 



(i ^ bdj 

,t>bd 



* r ,t<bd. 



(12) 



Here 6, from the volume constraint 1101 must equal 

1/3 

(1^) . The Fourier transform of the approximation is 



also similar to the exact one, although the low level "rip- 
ples" for values of / large compared to l/d are differently 
placed. These ripples are of course averaged away when 
a continuum of grain sizes are used, and hence should 
have no effects in applications to real materials. Thus 
the approximation given in 1 121 for a cube of side d will be 
adopted as a simple expression for work here. 

The equation |S1 provides a model for mixed-grain ra- 
dial covariances, given the phase abundances p[i] and the 
containment probability function ^i[t] for grains of each 
phase. Note that in this well-mixed case, auto-covariance 
will always be a positive monotone-decreasing function of 
lag which goes to zero for lags large compared to "com- 
positional" grain sizes in the specimen. In particular, if 
the grain geometry (in terms of the containment proba- 
bility function) is the same for all phases, then all of the 
(TAB[i] for the specimen exhibit the same t dependence. 
For such a specimen, knowledge of the t dependence for 
the radial auto-covariance of one variable would spec- 
ify the t dependence for all. Conversely, if the n phases 
obey one of m different containment probability functions 
(m < n) , then measurement of the radial auto-covariance 
for m variables might allow calculation of the contain- 
ment probabilities for each phase type via|H| as well as 
a check of the model by predicting values of the radial 
cross-covariances (i.e. cases in which A B). 

Given that many geological samples can be consid- 
ered neither homogeneous glasses nor well-mixed brec- 
cias, then what deviations from the well-mixed model are 
to be expected? One simple deviation can be detected by 
checking the relationship between physical grain size and 
compositional grain size. If the two do not agree, then 
a more complicated history is required. For example, a 
physical grain size much smaller than the compositional 
grain size might result from a breccia composed of larger 
fragments which were crushed by not well-mixed prior to 
compaction. Conventional petrographic analysis, unsur- 
passed in its ability for qualitative discernment of pat- 
terns requiring knowledge of higher statistical moments, 
would be helpful in verifying the cause even if the effect 
itself required the quantitative approach described here. 

In a wider variety of ways, well-mixed materials can 
be modified by open and closed-system thermodynamic 
processes. If the scale of transport allowed in such pro- 
cesses is large compared to the sample size, especially 
in the presence of a long-range gravitational force, then 
the resulting effects on specimen heterogeneity must be 
handled case by case. But if in-situ transport processes 
are limited to distances smaller than the specimen size, 
then one overall effect of the redistribution of material is 
predictable: regions enriched on one element will be sur- 
rounded by regions depleted in that element. This means 
that the probability of finding such a redistributed ele- 
ment is likely to decrease upon leaving an enriched region, 
prior to leveling off at the average probability. Hence the 
radial auto-covariance for that element is likely to go be- 
low zero before it dies out as t gets larger. In the limit- 
ing case when an element-pure region (phase 1) is small 
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compared to a surrounding element-free region (phase 2), 
then the negative excursion for ctaaM goes down to: 

mm[(TAA[t]] = - ^^^^ 

Here bi is a characteristic dimension of the enriched zone 
and 62 a typical size for the depleted zone. 

The possibly diagnostic nature of such deviations from 
the well-mixed case suggests that measurement of these 
quantities from actual samples, in addition to model- 
ing on the basis of the observation of phase abundances 
and geometries, might be a worthwhile pursuit. This 
is especially true in light of the fact that information 
on point-to-point compositions in geological materials is 
frequently available, and the data on relative locations of 
the sampled points easy to obtain, during the course of 
automated modal analyses. 



III. MEASUREMENT FROM COMPOSITIONAL 
DATA 

Compositional information, as a function of position 
in a specimen, is most conveniently measured on flat 
sections of the specimen. This is true whether compo- 
sitional identifications are made by optical microscopy, 
polished-section energy or wavelength dispersive x-ray 
analysis, or thin-specimen analytical transmission elec- 
tron microscopy. As mentioned at the outset, measure- 
ment of radial covariances from "two-dimensional" sam- 
ples of this sort requires either (1) an assumption that 
the specimen analyzed is isotropic, or (2) averaging of the 
data for sections which sample a wide range of orienta- 
tions in the specimen. A third alternative is to use slices 
through the specimen taken in a why such that the rela- 
tive three-dimensional positions of the sampled locations 
are known. Observations of the chondritic interplanetary 
dust aggregates examined in this thesis, with SEM and 
TEM, provide no indication that the assumption of radial 
isotropy is unwarranted at this point. 

As will be easier to see in examples of the Fourier 
transformed radial covariances, the spatial resolution of 
the sampling method and the maximum separation be- 
tween points considered in the covariance calculation de- 
termine, respectively, the upper and lower bounds (in 
terms of spatial frequency) on the "spectral window" 
through which specimen heterogeneity is being examined. 
In order to avoid aliasing it is useful to set the minimum 
spacing between analysis points (the "lag interval" ) to 
be on the order of (or smaller than) the spatial resolu- 
tion of the point analyses. In order to improve spectral 
resolution and increase the number of effectively inde- 
pendent sample locations, it is important to sample as 
wide a range of lags (point separations) as is possible. In 
meeting these criteria for a fixed number of point anal- 
yses, a simple square array of sampled locations is not 
the optimum configuration, but it is one which will be 



frequently encountered. In the discussion to follow, we 
will assume quite generally an array of M sample loca- 
tions: Ai and Bi are the values measured for A and B at 
the ith sample location (position Xi,i — 1, M). Thus the 
postscripts on A and B refer to sampled locations, not 
to mineral phases. 

Although a number of slightly different "sample" cross- 
covariance functions could be defined, it is simplest in the 
interest of subsequent error analysis to use for and /z_b 
the best available estimates for the mean values of A and 
B in the specimen, and then to calculate: 



SAB[t]^ [T^]T.T.^t^J{A- flA)iBj- fiB). (14) 

Here Snj is simply a delta function used to disallow from 
the double sum all terms except those for which analy- 
sis points i and j are separated by a distance t in the 
specimen, and Mt is just the double sum over that delta 
function (i.e. the number of point pairs in the sample 
separated by a distance t). Of course, any two or three 
dimensional sample configuration is likely to include pairs 
which are not all integral multiples of the (minimum) lag 
interval. At. In order to create a set of equally spaced 
data points for the Fourier transform process, it is useful 
to "widen" Suj to allow all pairs into the sums for t which 
have separations within ^At on either side of t, and then 
to evaluate m for all tj = {j - \)At,j = l,T/At. Here 
T is the maximum lag introduced into the calculation. 
Because of the large uncertainties associated with data 
on lags larger than half of the sample size, T will usually 
be chosen to be less than half of the maximum separation 
between points in the sample. 

As with any estimate of the radial covariance based on 
measurements from a finite sample. il4l provides a biased 
estimate. If we define restricted averages, based only on 
data pairs separated by the distance t, of the form: 

. s A/ M 

then with simple algebra it is easy to verify that: 

{sAB[t]) = (JAB[t] - {{At - lXA){Bt - Mis)). (16) 

If lags of all sizes are uniformly distributed over the sam- 
ple, then the second term in^jwill depend very little on 
t, and be approximately equal to {{A — ^a){B — ^b)) , the 
covariability in estimates of ^la and /is from the sample. 
Thus when A = B, the probable bias in the estimate 1141 
is just the variance in the estimate of fiA, which cannot 
be determined without assumptions about the character 
of the parent specimen (see next section). The amount of 
bias is roughly the same for all t, and hence IT4l provides 
an unbiased estimate for the shape of cryiB[t], but not 
for its absolute position in the vertical direction. On the 
other hand, the bias in estimates of the spectral density 
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is not independent of frequency. Before estimation of the 
spectral density, however, the abrupt cutoff in the radial 
covariance estimate at t = T must be addressed. 

In order to avoid forcing the calculated covariance 
function s^sM abruptly to zero for i > T, it is 
traditional'^ to do this gradually prior to Fourier trans- 
formation by multiplying the measured covariance func- 
tion by a function which goes "gently" from 1 down to 
zero over the interval < i < T. A common choice of 
this function is the Hanning lag window, defined by: 

^[,].{^(l+cos[f]) ,t^T (^^^ 

The finite range of lags allowed into the calculation for 
-SAsii] inevitably decreases the resolution of the mea- 
sured spectral density. Gentle supression of the covari- 
ance function for large lags prior to Fourier transforma- 
tion results in a simple "blurring" of details from the 
spectral density function. Without this "smooth" win- 
dow, the loss of resolution would occur in a more com- 
plicated way. 

If we define the number of terms in the calculation: 
N = T/At; and the discrete frequencies fk = {k — 
i)/(2T),fc = l,iV; then we can write the three dimen- 
sional digital transforms (following |21 and as: 

(At? ^ t 

9AB[fk] = T^^sfel^Iij] sinpvr/fet,], (18) 

1=1 

{or k ~ 1,N, and conversely for j = 1,N: 
2 ^ f 

SAB[t,]D[t,] - ^ E ^9AB[fk] sin[27r/fct,]. (19) 

j=l 3 

To examine bias in the spectral density estimate, the 
discrete transform of ^] can be taken. Again if the lags 
are uniformly distributed over the sample volume, then 
to first order this can be written: 

{9AB[f]) = iAB[f]-{{A~l^A){B-y.B)) 7 sin[2^/t,]. 

(20) 

The summation term in this equation is the discrete ap- 
proximation to a delta function at the origin. That is, it 
is very large for / ~ 0, but very small for other values 
of /. The upshot of this is that the spectral density es- 
timate is likely to be seriously biased due to uncertainty 
in the specimen averages only for / > (1/T). For as- 
pects of the estimates for which bias due to the finite 
sample is not a problem, the analysis of uncertainties is 
straightforward if A and B are assume to be Gaussian 
variates^. As mentioned above, one result of such analy- 
sis is that large uncertainties are associated with data on 
lags larger than half the sample size. These uncertain- 
ties result in large uncertainties for all frequencies in the 
spectral density estimate. Thus making the maximum 
lag allowed into the calculation (T) small compared to 



the sample size has two effects on the spectral density es- 
timates: It decreases spectral resolution but at the same 
time increases the stability of spectral density estimates 
for all frequencies. 



IV. THE STATISTICS OF SAMPLING 

Suppose one measures the variable A at M points on a 
specimen, and wishes to determine an average value for 
A in that specimen. The M analysis "points" constitute 
the sample. Standard practice is to calculate the sample 
mean from the measured values for A: 

M 

^-mE^'- (21) 

This value A constitutes an "unbiased" estimate for the 
specimen (or "population" ) mean /i^ since the expected 
value for this measurement is indeed the population 
avearge, i.e.: 

{A) = MA. (22) 

The object of this section is to provide a prediction 
for the error in this estimate. To see how this error will 
depend on specimen heterogeneity, one need only con- 
sider the case in which all M analysis points are clus- 
tered into a tiny volume much smaller than the grain 
size of a rock. One would hardly expect to learn much 
about the composition of the whole rock from such an 
analysis, regardless of the size of M . On the other hand, 
if the analysis points are randomly spread out over dis- 
tances much larger than the average grain size, then for 
M sufficiently large it should be possible to determine 
the rock composition arbitrarily well. A nice feature of 
the approach adopted here is that the resulting answer 
will require no assumption about the random mixing of 
the constituent grains. It instead follows simply from the 
definition of radial covariance. 

The expected value for variance in the estimate of 
from measurements on the sample of M points is: 

^ M M 

{{A - ^^A)') ^ ^ E E - ^■4)(i3, - Ms)) (23) 
i=i j=i 

where the term in the summations is none other than 
CTAAiiy], i-e. the radial variance for separations Uj be- 
tween ith and jth sample locations. Thus the variance in 
estimates of the mean is just the average of radial auto- 
covariance for data from pairs of points in the sample. In 
a similar fashion, an estimate of the covariance between 
sample estimates of /i^ and /is can be written as: 

M M 

{{A - fiA){B - Mb)) ^ E E ''^bM- (24) 

i=i j=i 
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Thus knowledge of the radial covariance functions for a 
given specimen allows prediction of the uncertainty in es- 
timates of means based on measurements from a limited 
sample. 

The way in which sample size and compositional 
coarseness modifies estimates of uncertainty in the mean 
can be seen more clearly if the "i=j" terms are separated 
from the double sum in eauation l23l One can then write: 



JJ^A > \P\ « 17 
(25) 

where a sample bias coefficient for the sample configu- 
ration has been defined as an average over all pairs of 
sample points by: 



Again, these values equal those for independent sample 
locations if \p\ is sufficiently small. However for a suffi- 
ciently small sampled region, p approaches one and even 
hundreds of analysis points on such a small sample can- 
not result in reasonable uncertainties for our estimate of 
MA- 

For the case when the sampled region is not a set of 
discrete points but a continuous sampled volume, then 
two simplifications ensue. First, the number of analysis 
points M effectively goes to infinity, so that the limiting 
cases of equations 1251 and 1301 applv for which \p\ ^ 
Secondly, equation 1261 can be rewritten: 



47r 
V 



l^E[f]f'df (31) 

O'AA M 



M M 



(26) 



This sample bias coefficient is to first order dependent 
only upon the lag dependence of the covariance function 
for A, and on the sample size. It, like the radial covari- 
ance function, is expected to approach zero for sample 
locations which are widely separated. Hence, for suffi- 
ciently large sample regions, equation [23 predicts that 
sample analysis points can be considered independent of 
one another. By the same token, however, for a sample 
of fixed size, an increase in the number of analysis points 
beyond M ~ |l/p| buys little decrease in uncertainty. 

If one wishes to estimate cr\ from the same limited- 
sample measurement, the increase in uncertainty due to 
a sample which is too localized is even more severe. If 
one measures the sample variance: 



(27) 



then an unbiased estimate for a\ is obtained from the 
relation: 
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M 



M-lJ 1- p 



{ 



Equation 1251 can then be written as: 



M 



(28) 



Here ^[t] is the containment probability defined in section 
im but this time with reference to the sample volume V 
instead of a grain volume. As before, S[/] is the radial 
Fourier transform of ^[t]. When viewed in this light, the 
quantity is just the probability of two points in the 

sample being separated by distance t, per unit volume in- 
crement AiTt^dt. The integration constraint 1101 becomes : 



(32) 



Because of the presence of V in the denominator of equa- 
tion!^ a sample bias coefficient can also be calculated 
for two-dimensional and one-dimensional sample arrays. 

For the purpose of this study, the most welcome facet 
of equation 1311 is that the cube approximation of 1 121 can 
be utilized for both S\t] and for tr^AM/fAAiO], in one case 
with reference to the sample volume and in the second 
case with reference to the specimen grain size, to provide 
a good general purpose estimate of sample bias coefficient 
as a function of sample size D and grain size d: 



l-(f) + f(f)' ■D<d. 
- { (33) 

{if-iit + Hif ^D>d. 



(M-l)(l-p)^'^>' 
which in the limits becomes 

jr^i^A) Jpl«i7 

{{A-pAr) = { 
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APPENDIX A: HETEROGENEITY OF 
INTERPLANETARY DUST 

One of the qualitatively striking features of material 
in chondritic interplanetary dust is the small, submicron 
size scale for chemical heterogeneity^^. This is manifest 
in three ways: i) in the small (e.g. 10-100 nm) size of in- 
dividual mineral grains; ii) in the fact that compositions 
at locations separated by distances as small as 200 nm 
often seem to bear no relationi to one another, and iii) 
by the remarkable observation^'' that, in spite of their 
small size, nanogram specimens which probably sample 
different parent bodies show abundance ratios to silicon 
for 10 major elements which on the average agree within 
40%. 

Chondritic aggregates are not the ideal samples on 
which to begin making quantitative measurements of spa- 
tial heterogeneity, since contiguous sections of the aggre- 
gates available for such measurements are seldom more 
than several microns across. On the other hand, informa- 
tive models of spatial heterogeneity are possible and fur- 
ther testing of those models may be possible with return 
of Stardust mission cometary dust specimens in 2006. It 
is toward these ends that this appendix is designed. In 
the second section of this appendix, it will be shown that 
the small grain size of the chondritic aggregates suggests 
that the 40% spread in element to silicon ratios men- 
tioned above is evidence for significant diversity in the 
compositions of chondritic aggregate source materials, in 
so far as those compositions remain intact in our samples. 



In addition, comparisons with material in meteorites are 
discussed. In this section, the theoretical framework un- 
derlying the basic approach is addressed. 

In discussing chondritic aggregate "source materials" , 
it is important to address the fundamental problem in 
simple terms at the outset. By "source material" we re- 
fer to the parent mass of a particular aggregate. Implicit 
in this is the assumption that the aggregate was at one 
time part of a larger body. The parent mass might, for 
example, be a geological zone in a comet. If comet ex- 
teriors are largely undifferentiated, then the geological 
zone might be an accretionary layer. In any case, it is 
obviously impossible to deduce the properties of a source 
material based on information which pertains only to a 
sample of that material. On the other hand, it is reason- 
able to make inferences based on what is known, even if 
that is very little. 

To illustrate our predicament with respect to chon- 
dritic aggregate materials, suppose someone was to pro- 
vide you with a hand specimen of material, and without 
providing additional information, to ask for your best 
guess as to the nature of the source. If the hand spec- 
imen was a crystal of uniform composition, you might 
guess that the parent material was a collection of similar 
crystals, in the absence of any hint as to other possible 
constituents. But having had some experience with ter- 
restrial rock formations, you might have strong doubts 
that the single crystal was a very representative sample. 
If the hand specimen was instead an aggregate of thou- 
sands of millimeter sized crystals of three distinct miner- 
als in apparently random juxtaposition with no obvious 
gradients or anistropy, you might have considerably more 
confidence in your estimate of source mineralogy, compo- 
sition, and structure from the sample. Of course, in either 
case the hand specimen might have been unrepresenta- 
tive, but given a feel for the random way in which rocks 
are often put together, much more confidence would be 
accorded to inferences from the second type of sample. 
One assumes that the parent material is a simple ran- 
dom mixture of crystals in the absence of evidence to the 
contrary. In the remainder of this section, the implica- 
tions of this assumption will be made quantitative with 
help from results from the paper to which this appendix 
is attached. 

The measure of point-to-point heterogeneity central to 
this argument can be expressed in two complimentary 
forms, as the radial covariance function cTAsi^] or its 3-D 
Fourier transform, the spectral density 7ab[/]- Here A 
and B are both composition variables in the specimen 
(e.g. volume fraction of olivine, mass fraction of silicon, 
etc.) When A and B represent the same variable, we 
often prefer displaying spectral density as the decompo- 
sition of standard deviation per decade of frequency (or 
size-scale). 

When a variety of structures is present in a specimen, 
one of course formally should Fourier transform the whole 
spatial array, thus in effect adding complex Fourier co- 
efficients from each structure. However, when one can 
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FIG. 1: Root mean square compositional variability per 
decade of size-scale (or frequency) for log-normally distributed 
single-phase cubes one micron on a side, and spheres one mi- 
cron in diameter. 




FIG. 2: Root mean square compositional variability per 
decade of size-scale (or frequency) for log-normally distributed 
single-phase spheres with three different mean diameters and 
diameter standard deviations. 



treat the structures as essentially uncorrelated in posi- 
tion with respect to one another (i.e. as ideally random 
in distribution), it is convenient instead to simply "add 
Fourier intensities (i.e. squared amplitudes)" from each 
object type. 

Thus for a random mixture of cubic grains of side d 
which have various values for A and B, it follows from 
eauation ll2l in section^that these functions are approx- 
imately: 

,^ ,^^s[0](l-^)' ,t<bd 

CTAB[t] = { (Al) 

,t>bd 



and 



2 + cos[27r/6d] - 3 sin[27r/6d] 
2T:^f%d 



aAB[0\. (A2) 



1 /3 

Here 6 = (|^) , and ctasP] is the covariance of A 
and B in the specimen. Similar expressions follow for a 
random mixture of spherical grains from equation II II 

For specimens composed of pure phase {A = 1) grains, 
10% abundant by volume in an ^ = matrix, the result- 
ing standard deviation decompositions for collections of 
lognormally distributed grains are illustrated in Figures 
Hand 12 In particular. Figure ^ illustrates the effect of 
grain shape differences (cubes versus spheres) for a rel- 
atively tight distribution of grain sizes (size or diameter 
mean of l/iw and standard deviation of 0.1/xm. Tighter 
size distributions (i.e. smaller ratios of standard devia- 
tion to mean size) result in even more oscillations in the 
high frequency (small size-scale) sides of the peak. 

Figure|21illustrates the effect of changing grain size and 
standard deviation. Note that the effects of grain shape 
(e.g. Fourier ringing) are washed out as the range of sizes 
broadens. We also expect this to be the case as individ- 
ual grain symmetries decrease. Of course, correlations 



in grain position or orientation might add new structure 
(e.g. lattice periodicities) to such heterogeneity spectra. 

Our experience with fine-grained interplanetary dust 
particles (typical sizes S/^m in diameter) suggest that 
the porous or "re-entrant" material has a mean size for 
monomineralic grains (i.e. olivines, pyroxenes, sulfides) 
in the 0.1/im size range^^. However, particles with multi- 
micron sized monomineralic (e.g. olivine) grains with 
attached fine-grained material are also found. Thus for 
example a well-mixed lognormal grain size distribution 
with mean near 0.2fj,m but a much larger standard devi- 
ation may be appropriate for some types of interplanetary 
dust. The well-mixed model could be tested on micro- 
tomed dust particles not available at the time of the orig- 
inal paper, as well as against fine-grained carbonaceous 
meteorite matrices which may prove to be composition- 
ally much coarser. 

The tools described in this paper further allow one to 
model the relationship between individual dust particles 
and their "parent bodies" . For example, if the size and 
shape of individual grains is independent of composition, 
the correlation coefficient of a given particle to it's source 
might be writteri 



Pnet 



(A3) 



Here pi is the volume fraction of grains of size di . More 
careful work on this question is likely worthwhile, par- 
ticularly in anticipation of the Stardust comet sample 
return mission in 2006, at which time multiple parti- 
cles from a single identified cometary source will become 
available. 
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FIG. 3: Root mean square roughness per decade of size-scale 
(or frequency) for log-normally distributed gaussian bumps 
three different mean diameters and diameter standard devia- 
tions. 



APPENDIX B: LOG-LOG SCALE ROUGHNESS 
SPECTROSCOPY 



Thus once again one might relate spectral density physi- 
cally via a product (here to that portion of the 
covariance itab[0] = Cab[0] — fiAfJ-B between A and B 
in a given "e-fold" of frequency (or decade of frequency 
should one further multiply by In 10). 

Consider for example a Gaussian bump, of half-width 
d at half of its maximum height A. If this bump is 
centered at the origin, it's height as a function of po- 
sition is h\t] = y 4cxp[— TQs-j. The 2D Fourier transform 
of this fromEUis H[f] = 2'K(fAeiip[-2TT^pd^]. Mov- 
ing the bump from the origin will affect the transform's 
phase, but not the spectral density obtained by squaring 
it's amplitude: ^hh[f] = Air'^d'^A'^ expl-Air^pd'^]. In- 
verse transforming this gives a height autocorrelation of 
C;i;i[t] = Trd^A^ exp[— which becomes the autoco- 
variance cr;ih[i] as well if indeed the spatial region inte- 
grated over is large enough to render the average height 
essentially zero. 

Note in particular the "line of hemispheres" in Fig. O 
Crossing above this basically signals the existence of ob- 
jects on a given width scale which are taller than they are 
wide. Grassy plains might represent an extreme in this 
regard. At the other extreme (exceptional flatness), sili- 
con wafers immediately after growth of an epitaxial layer, 
with O.lSnm dimer row steps sometimes separated by 
a micron or more laterally represent the other extreme. 
We anticipate providing some experimental examples of 
these extremes, and further application examples, in a 
2nd revision. 



Here we turn from studies of compositional heterogene- 
ity in three dimensions to the study of height variations 
across a surface in two dimensions^'^'"'^^. The transform 
equation (e.g. ^ in two dimensions is: 



lABlf] = 27T / tJo[2Trft]CAB[t]dt, 



(Bl) 



APPENDIX C: 



LATTICE FRINGE COVARIANCE 
FINGERPRINTS 



where Jq is the 0th order Bessel function of the first kind. 
As with the inverse in three dimensions (equation ^ , 
the reverse transform in two dimensions is the forward 
transform, with t and / exchanged. The corresponding 
t = relationship in 2D then becomes: 

OO CXD 

Cab[0]^ J 27r fjABdf= J 2^/Vsd(ln/). (B2) 



In this appendix, covariances in high energy electron 
intensity as a function of scattering angle are examined 
as a tool for fingerprinting nanocrystal assemblages^^, 
given one or more high resolution lattice fringe images 
to work with. Calculations of these for some common 
nanoparticle structures are underway, and an expansion 
of this discussion is anticipated in a 2nd revision as well. 



